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Ahstract 


J  The  oscillatory  appearance  of  stationary  time  series  is  captured 


very  economically  by  only  a  few  higher  order  crossings  which  in  addition 


contain  a  great  deal  of  the  spectral  content  of  the  process.  A  useful 


approximation  to  the  variances  of  higher  order  crossings  is  discussed 


and  is  applied  in  the  construction  of  probability  limits  for  the 


hypothesized  higher  order  crossings.  From  this,  a  graphical  display 


of  higher  order  crossings  together  with  their  probability  limits  provide 


a  fast  goodness  of  fit  test.  Examples  illustrate  the  applicability  of  this 


device. 
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1.  Introduction. 

There  has  been  a  growing  interest  in  graphical  methods  in  time  series 
analysis  and  especially  so  since  the  popularization  of  electronic  devices  with 
graphics  capabilities.  In  following  this  trend,  the  present  article  discusses 
a  cetain  zero-crossings  based  graphical  technique  useful  in  testing  for 
goodness  of  fit  of  time  series  models.  The  idea  is  to  use  plots  of  higher 
order  crossings  which  are  akin  to  plots  of  the  correlogram  and  spectal  densities 
or  the  periodogram,  but  with  the  advantage  of  great  simplicity.  Under  the 
Gaussian  assumption,  the  sequence  of  expected  higher  order  corssings  is  equivalent 
to  the  autocorrelation  function  and  hence  to  the  normalized  spectral  distribution 
function,  but  it  summarizes  the  data  differently.  In  this  regard,  the  monotone 
property  of  higher  crossings  plays  an  instrumental  role  for  the  initial  rate  of 
increase  exhibited  by  higher  crossings  proves  to  be  an  effective  summary  feature. 
As  the  higher  crossings  continue  to  increase  their  rate  loses  its  discrimination 
potency  and  different  processes  seem  to  share  similar  rates.  This  is  why  in 
general  very  few  higher  crossings  are  used  in  testing  goodness  of  fit. 

The  present  paper  gives  an  overview  of  our  previous  work  particularly 
Kedera  and  Reed  (1985)  and  Kedem  (1985)  to  which  the  reader  is  referred  for 
mathematical  details  and  more  examples. 


\  Ac" 


3 


2.  Plots  of  Higher  Order  Crossings. 

Let  {zt},  t  =  0,  ±1,...,  be  a  zero  mean  stationary  Gaussian  process  with 
correlation  function  p^.  and  normalized  spectral  distribution  function  F, 
and  let  V  be  the  difference  operator,  y -  z  It  is  convenient 

to  introduce  the  clipped  binary  process 


x 


(k) 

t 


1,  Vk_1z  >  0 
’  t 

0,  otherwise 


k  =  1,2,... 


which  gives  rise  to  the  indicator  at  time  t 


,  1,  *<k>  *  x<k> 

,  (k)  t  t-1 


0,  otherwise. 


The  higher  order  crossings  of  order 


k,  D 

. .+ 


k,n  ’ 
d(k) 


n 


is  defined  by 


It  is  seen  that  D.  counts  the  number  of  axis-crossings  in  the  (k-l)’th 
k ,  n 

k-1  k-1 

differenced  series  y  z,,...,y  z  .  D,  then  is  the  usual  number  of 

1  n  l,n 

zero-  or  axis-crossings  by  the  original  series  z,,...,z  . 

1  n 

From  the  point  of  view  of  the  theory  of  stationary  Gaussian  processes,  the 
sequence  of  higher  order  crossings  is  equivalent  to  the  correlation  and  spectral 
structures.  This  is  stated  precisely  in 

Theorem  1.  Let  { z ^ }  be  a  zero  mean  stationary  Gaussian  process  with  correlation 

function  .  Then  the  sequence  f }  is  completely  determined  from  the  sequence 

{E(D.  )}.  That  is,  p,  is  determined  by  E(D  ),..., E(D.  ). 

k  I ,  n  k  }ti 


Proof;  From  Kedem  and  Slud  (1981), 


Cos^ 


*<Dlc+l.n> 


,2k  v.  r /2k.  ,2k  .  _.k 

_(k-l)+PLf  (  k  )+(k-2}  ‘  •+('1>  ^k+1 


n-1 


(i) 


and  the  p,  can  be  determined  recursively  from  the  E(D.  ). 

j  k,n' 


□ 


Obviously  it  is  also  true,  from  (1),  that  knowledge  of  {p^  }  is  equivalent 


to  knowledge  of  the  sequence  {E(D.  )  }.  It  follows  that  F  is  completely 

J  • n 


determined  by  the  sequence  of  expected  higher  order  crossings.  This  is 
summarized  by  the  symbolism 


Thus,  exactly  for  the  same  reasons  that  plots  of  and  F  are  extensively 

used  in  time  series  analysis,  it  is  useful  to  observe  plots  of  higher  order 
crossings  too. 

The  main  thing  to  observe  in  plots  of  higher  order  corssings  is  the  rate 

at  which  they  incrfe&se  and  the  starting  point  D..  .  The  fact  that  higher 

1  ,n 

order  crossings  tend  to  increase  can  be  attributed  to  the  general  fact  that 


D. 

J  »n 


£  D 

j+l,n 


+  1 


with  probability  one.  Hence  the  D.  tend  to  increase  with  j  for  fixed 

J.n 

but  large  n.  See  also  Kedem  and  Slud  (1981). 

It  is  instructive  to  observe  plots  of  higher  order  crossings  and  thus 


motivate  the  central  idea  of  this  paper.  Figure  1  displays  plots  of  ten 


obtained  from  first  order 


higher  order  crossings  D1>1000,  •  •  •  ’D10,1000’ 
autoregressive  processes  with  different  parameter  values  <j>.  It  is  seen  that 
the  initial  rate  of  increase  and  starting  point  differ  from  process  to  process, 
but  that  as  the  order  increases  the  rate  is  almost  independent  of  the  parameter. 
This  same  behavior  has  been  observed  in  numerous  cases  which  may  be  interpreted 
to  mean  that  only  the  very  first  few  higher  crossings  carry  sufficient 
information  which  discriminates  clearly  between  different  processes. 


Accordingly , 


it  is  suggested  that  plots  with  as  few  as  six 


D. 

J  »n 


can  be  useful  in  goodness  of  fit  testing.  At  the  same  time  it  should  be  noted 


that  higher  order  crossings  of  high  order  carry  information  too  but  this 


information  is  less  amenable  and  will  not  be  used  here. 


Figure  1.  Plots  of  10q0*  j=l,...,10,  from  z t  =  +  ut>  ut  are 

independent  N(0,1)  random  variables  and  <\>  =  0.75,0.5,0.25,0,-0.25,-0.5,-0.75. 
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3.  The  variance  of  higher  order  crossings. 

The  probability  distribution  of  the  ^  is  quite  intractable  and  we 

shall  concentrate  on  the  more  modest  problem  of  approximating  the  variance  of 

higher  order  crossings  needed  for  the  proposed  goodness  of  fit  test. 

In  general,  the  variance  of  D.  is  a  function  of  the  fourth  order 

J  >n 

cumulant  function  K^~^(r,s,t)  of  {X^}  which  is  summable  under  appropriate 
moment  conditions.  Thus  for  j  =  1  the  following  asymptotic  result  was  proved 
in  Kedem  (1980) . 


Theorem  2.  If  p.  is  absolutely  summable  then 


l  l^u.-ka-ioi  <  - 

k=-« 


and 


D  -  E(D  )  L  ? 

— *- - - 1 —  •+  N(0,  a  )  ,  n  -+  co, 

^  1 


where 


CO 

=  ---  l  Ksin  1Pk)2  +  sin  ^P^sin  1pk+1  +  (l,-k,l-k)] 


Tt  k=-°° 


The  same  result  applies  to  every  ^  provided  the  correlation  functions  of 


{y^  ^zt)  decays  to  zero  fast.  However  is  not  known  in  general  which  makes 


the  above  result  impractical. 

Another  approach  is  to  hold  n  fixed  and  let  j  increase.  In  this  case 
it  is  possible  to  obtain  a  useful  asymptotic  result  under  the  assymption  of 
m-dependence .  Assume  that  tt  is  a  point  of  increase  for  F  and  let 


(k) 


V  (x_ 
r  t 


(k). 


Then  X.  -*■  0  as  k  -*■  «>  and  it  was  shown  by  Kedem  and  Reed  (1985)  that 


„„„  -00  .(k)  _  ,.(k), 

cov  (d  ,d  )  =  o  (  ) . 


(2) 


(k) 

The  proof  of  this  fact  depends  on  the  differential  properties  of  p  ,  the 
correlation  function  of  {^z  >.  (2)  readily  yields. 


Theorem  3.  Let  (z  }  be  an  m-dependent  stationary  Gaussian  process  and  assume 
that  it  is  a  point  in  the  support  of  F.  Then  for  fixed  n 


lim 

k-*» 


Var  (D.  ) 

_ k ,  n 


(n-l)Ajk)(l-A1(k)) 


=  1. 


This  result  was  used  in  the  construction  of  probability  limits  for  the  higher 

crossings  under  the  hypothesis  of  white  noise.  However  the  assumption  of 

m-dependence  cannot  always  be  verified  and  another  approximation  is  called  for. 

A  rather  close  approximation  to  the  variance  of  D.  can  be  provided  if 

j.n 

(k) 

it  is  assumed  that  the  binary  sequence  (d  }  is  a  Markov  chain.  This  first 
order  approximation  has  been  found  very  satisfactory  by  an  extensive  simulation 
Define  the  two  parameters  associated  with  the  chain, 


(k)  (k)  <k> 

■  =  1  -  Aj  ,  <3 


A 


(k) 


1-2A (k)  +  A(k) 
(k)  _  Xl__  _  A2 

2 ( 1 -> [k)) 


When  the  proi ess  is  a  stationary  Gaussian  autoregressive-moving  average  process 

(k)  (k) 

with  known  (or  hypothesized)  parameters,  p  and  v'  are  known  too 


(k ) 

expl i c ity ly .  Then  if  Id  ^  is  a  Markov  chain  it  can  be  shown  (Kedem  (1985)) 


Va  r(D  )  =  (n-l)p^q^  + 
k,  n 


(k)  (k)  (k)  (k) 

' - [  (n-1 )  -V  ] 

1_v)(k>  k,n 


(3) 


where 

V 


k,„  -  q<t,Il-(^y-),’-1)/(l-«<k)). 


This  approximation  has  been  compared  (Kedem  (1985))  with  actual  estimates 

obtained  from  100  independent  realizations  each  of  length  n  =  1000.  The 

results  are  given  in  Table  I.  Although  E (D^  ^qqq)  are  known  explicitly  when 

the  parameters  are  known,  these  expectations  are  estimated  too  as  a  check 

of  the  whole  simulation.  It  is  seen  that  (3)  agrees  well  with  the  simulation 

(k)  (k) 

results.  An  algorithm  for  obtaining  p  ,  v  is  given  in  Kedem  (1985). 
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Series 


White 

Noise 


AR(2) 

^  =  0.4 

'{>,  =  -0.7 


ARMA (1,1) 
{,  =  0.5 
9  =  0.7 


ARMA (2,2) 
=  -1'4 

w2  =  -0.5 

0,  =  0.2 


1  E(Dj,1000)  E(DJ,1000) 

From  100 
Realizations 


1 

500 

497 

2 

666 

666 

3 

732 

732 

4 

769 

770 

5 

794 

795 

6 

813 

814 

1 

424 

425 

2 

484 

485 

3 

536 

537 

4 

594 

594 

5 

651 

652 

6 

702 

701 

1 

552 

552 

2 

679 

6  79 

3 

737 

737 

4 

773 

772 

5 

797 

797 

6 

814 

814 

1 

884 

883 

2 

89  7 

897 

3 

903 

90  3 

4 

908 

908 

Lj 

911 

911 

6 

914 

914 

1 

{Var(Dj  ,1000^  *Var(Dj,  1000) 


From  (3) 

From  190 
Realizations 

15.81 

15.96 

13.15 

13.63 

12.16 

12.53 

11.57 

11.49 

11.18 

11.05 

10.82 

10.00 

9.64 

9.67 

9.38 

9.13 

10.29 

10.81 

11.27 

12.72 

11.87 

12.02 

12.04 

11.34 

14.62 

14.  74 

12.96 

12.87 

12.09 

12.05 

11.27 

11.52 

10.  70 

1  1.12 

10.15 

10.80 

10.04 

10.51 

9.20 

9.53 

8.84 

9.01 

8.60 

8.50 

m 

vf 

00 

8.47 

8.29 

8.38 

r)2  =  0.1 


Table_J.  Comparison  of  (3)  witti  the  variance  obtained  from  100  independent 

realizations  of  size  1000.  E  (D .  and  E  (I) .  are  rounded  to  the 

j , 1 000  I  , 1 000 


nearest  integer. 
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4.  A  Graphical  Goodness  of  Fit  Criterion. 

The  proposed  goodness  of  fit  test  is  based  on  deviations  of  the  observed 

path  of  higher  crossing  from  the  expected  path  where  the  later  is  obtained 

under  the  hypothesis  of  an  assumed  model.  Marked  deviations  of  the  observed 

path  from  the  expected  one  suggest  that  the  observed  process  does  not 

osc il late  as  expected.  The  closeness  of  the  two  paths  can  be  measured  by 

appealing  to  (3)  and  to  conditions  under  which  the  D.  are  asymptotically 

3  > n 

normal.  It  can  be  shown,  using  the  technique  in  Cuzick  (1976)  that  when 

{z  }  is  Gaussian  the  condition  1\ P  I  <  00  implies  the  asymptotic  normality 
t  K 

of  the  D  .  It  follows  that  approximate  95%  probability  limits  for  D 

k  j  n  K ,  n 

are  for  each  k  and  sufficiently  large  n 


(n-l)p(k)  +  1.96(Var  D.  ll/2  (4) 

k,n 

where  Var(D,  )  is  given  by  (3).  When  at  least  one  observed  D.  ,j=l,...,6, 
k  , n  j  ,n 

lies  outside  the  limits  (4)  the  assumed  model  under  which  (4)  was  derived  is 
rejected.  before  discussing  the  power  of  this  test  it  is  illustrated  by  a 
few  examples. 


4.1  Examnl es . 

Annual _Mea_n  Temp e r atur e 

The  graph  of  the  annual  mean  air  temperature  from  1781  to  1980  at 
llohenpeissenberg ,  Germany,  is  given  in  Figure  2.  Actually  the  observations  for 
1811  and  1812  are  missing  and  were  replaced  by  the  mean  of  neighboring 
observations.  This  has  only  a  very  small  effect  on  the  sequence  of  higher 


c ross i ngs . 


Figure _2.  Annual  Temperature  Series.  n  =  195. 

(Source:  Report  It  155  of  the  Deutschen  Wetterdienstes ,  West  Cermay  (1981).) 

Since  annual  temperature  is  hard  to  predict,  we  could  ask  the  following 
question: 

Does  the  series  oscillate  as  white  noise?  The  answer  is  obtained  from  Figure  3 
where  it  is  seen  that  the  higher  order  crossings  are  well  within  the  bounds  (4) 
so  that  at  least  in  this  sense  the  series  resembles  white  noise.  For  comparison, 
the  figure  portrays  the  higher  crossings  of  simulated  white  noise  which  fall 
within  the  bounds  too  as  expected. 


I  i  gurr  i . 
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ARM A  Mod  els . 

Figure  4  shows  the  probability  limits  (4)  under  various  hypotheses. 

These  are  white  noise,  second  order  autoregressive  process  with  parameters 

0.4  and  -0.7,  and  second  order  autoregressive  moving  average  process  with 

parameters  =  (-1.4,  -0.5),  0  =  (0.2,  0.1).  The  actual  D.  _  were 

~  j ,45U 

obtained  from  simulated  data  given  in  an  appendix  in  Priestly  (1981).  The 
three  paths  fall  well  within  their  respective  limits  and  the  corresponding 
hypothesises  are  accetped. 


figure  4.  Sample  higher  order  crossings  paths  fall  within  their  respective  limits. 

It  is  seen  that  the  three  processes  display  different  oscillation  patterns 
which  are  captured  very  economically  by  only  six  higher  order  crossings  where  the 
ARMA  (2,2)  process  is  most  oscillatory  while  the  AR(2)  is  much  smoother. 

Signal  Detection. 

Figure  5  displays  two  series  which  appear  to  be  very  similar  except 
perhaps  for  scale.  However  their  higher  order  crossings  quickly  reveal  that 
the  first  one  oscillates  as  white  noise  while  the  other  oscillates  roughly 


ns  a  low  order  autoregressive  process. 


This  is  illustrated  in  Figure  6. 
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Figure  b.  The  higher  order  crossings  paths  of  series  (a),  (b) .  The  first 
path  is  within  white  noise  bounds. 

Diagonstic  Check. 

In  testing  the  goodness  of  fit  of  a  model  one  runs  a  residual  analysis 
which  usually  tests  whether  the  residual  series  constitutes  white  noise. 
Consider  series  A,  D  in  Box  and  Jenkins  (1976).  The  fitted  models  there 
(p.  293)  are 

series  A:  Vz^  =  u^  -  0.7u^  ^ 

series  D:  z  0.87z  ,  =  1.17  +  u 

t  t-1  t 

where  {u  }  is  the  residual  series.  Figure  7  however  reveals  that  the  two 

residual  series  are  not  quite  white  as  is  signified  by  the  axis-crossings 

themselves  winch  re  outside  the  limits  (4).  It  is  interesting  to  note 

though  that  the  rest  of  the  higher  order  crossings  behave  as  those  of  white 

noise.  Thus,  cm  opt  for  smaller  I)  ,  the  two  residual  series  oscillates 

i  ,  n 


as  wh  i  t«.  no  i  se  . 


Figure  7.  Diagnostic  check  applied  to  the  residuals  of  series  A  (n=177) 

and  series  D  (n-290) .  D,  is  outside  the  limits  (4)  for  white  noise. 

l»n 


4.2  Power  SimtO atinn . 

The  limits  (4)  provide  approximate  95%  bounds  for  each  D 


j.n' 


However 


our  test  is  based  on  D,  , ...,D,  simultaneously  and  the  hypothesized 

1 ,n  b , n 


model  is  rejected  if  at  least  one  ^  falls  outside  the  probability 


bounds.  It  is  expected  that  a  test  which  is  based  on  more  than  a  single 


D,  has  a  higher  probability  of  rejecting  a  true  hypothesis  than  0.05 
j ,  n 


and  in  fact  our  experience  indicates  that  with  six  D.  this  probability 

J." 


is  about  0.1.  The  exact  probability  is  still  an  open  problem  at  present. 

An  indication  of  the  power  is  provided  in  Table  II  which  gives  the  power 
for  tesing  the  hypothesis  of  white  noise  where  the  alternative  is  the  indicated 
process.  The  power  is  estimated  from  50  independent  series  each  of  size 


450.  Similar  results  were  obtained  for  greater  series  lengths. 


Process 


Power 


White  Noise  .10 

AR(1),  <$>  =  .05  .26 

MA (1) ,  0  =  .1  .40 

AR  (1)  ,  cf)  =  .2  .90 

AR(1) ,  4-  =  .5  1.00 

AR(2),  4  =  .1,  4>2  =  -.15  .88 

ARMA (1 , 1)  ,  4)^^  =  .1,  01  =  -.1  .86 

ARMA (2,2),  <p7  =  .1,  =  -.4  1.00 

ex  =  o,  e2  =  .3 

ARMA (2.2) ,  41  =  .1,  *  =  -.2  .88 


Table  II.  Power  sinulation  for  testing  white  noise  versus  the  indicated 
process . 
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